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Compositions and methods for producing high yields of heterologous 

polypeptides in a Pichia cell 


Field of invention 

5 The high yield production of certain polypeptides is of industrial importance, 

thus compositions and methods for producing high yields of heterologous polypeptides 
in a Pichia cell such as Pichia methanolica are of interest. 

Background of the invention 

10 Yeast cells are potential candidates for host organisms for heterologous protein 

production. To use Saccharomyces cerevisiae for the production of e.g. pharmaceutical 
products or industrial enzymes is well known. It has been suggested that methylotrophic 
yeasts that can utilize methanol as a sole carbon and energy source are more attractive 
host strains for heterologous protein production. High protein productivity has been 
15 achieved with strongly inducible promoters, such as the one from alcohol oxidase or 
formate dehydrogenase, by performing high cell density fermentations in which the dried 
cell mass exceeds 100 g/kg. Examples of high expression of heterologous protein are: 
2.5 g/l oi S. cerevisiae invertase (Tschopp et. al. Bio/Technology, vol 5, 1305-1308, 1987); 
3.4 g/l of human serum albumin in Pichia pastoris (EP 510693); 3.4g/I of glucoamylase 
20 from Rhizopus oryzae in Candida boidinii (Sakai et.al, Biochem. Biophys. Acta, 1308, 
81-87,1996); 1.4g/l of Schawanniomyces occidentalis glucoamylase (Gelissen et.al, 
Bio/Technology, vol 9, 291-295, 1991); and 7.2g/l of Aspergillus phytase in Hansenula 
polymorpha (EP911416). In WO 99/67398 an Aspergillus niger phytase is expressed in 
^ and secreted from Pichia pastoris by using a fusion .with the . signal peptide_oL yeast. 
25 a-f actor. 

Pichia methanolica is another methylotrophic yeast that has been shown 
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recently to produce human glutamate decarboxylase at the level of 0.5 g/l intracellular^ 
< Yeast 1^ 11-23. 1998). It has been stated that P. methanolica could be used for 
projut.-ti-j" neterologous protein, but no example of productivity or yield over 1 

gram pc litre nas been shown yet (US 5,716,808; WO 97/17450). 

5 

Summary of the invention 

Th»- problem to be solved by this invention is to provide expression systems and 
method fur policing a polypeptide of interest in high yields. 

r 

Th- so jtion is based on that the present inventors have identified compositions 
10 and methods for high yield production of a heterologous polypeptide of interest in a 
Pichia coil specifically Pichia methanolica. By adjusting the codon usage of a gene 
encoding a polypeptide of interest to the codon usage of one or more gene(s) from the 
wild type P methanolica, the inventors have shown that protein productivity surprisingly 
can be increased with as much as a factor of 10 or more. 
15 In non-limiting example herein heterologous polypeptides were expressed from 

coddn-optimized genes with original secretion signal sequences and secreted from P. 
methanolica. 

In the industrial production of polypeptides, a gene encoding a polypeptide of 
interest is most often present within the production host cell in multiple copies which are 

20 all expressed in order to achieve an optimally high yield of the polypeptide. Gene 
analysis by Southern blotting of a production cell of the invention revealed that the high 
polypeptide yield was achieved with a very low gene dosage of only approx. 2 copies pr. 
cell, which suggests a potential for increasing the yield of the polypeptide of interest even 
further by increasing the copy number. 

25 Accordingly in a first aspect, the pres ent inve n tion r elates to a Pichia cell 

comprising at least one copy of a heterologous nucleotide sequence encoding a 
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polypeptide of interest, wherein the codon usage of said sequence has been adjusted to 
match the preferred codon usage of P. methanolica, as defined herein. 

The preferred codon usage of P methanolica as relates to this invention is 
presented in Example 2, Table 1, e.g. as the codon-ratios used in the optimized phytase 
5 gene sequence for each amino acid. More specifically the preferred codons for Lysine 
are TTA or TTG; for Isoleucine they are ATT or ATC; for Valine they are GTT or less 
preferred GTC; for Serine they are TCT, TCC or TCA; for Proline they are CCA or less 
preferred CCT; for Threonine they are ACT or less preferred ACC; for Alanine they are 
GCT or GCC; for Tyrosine the more preferred is TAC; for Glutamine it is CAA; for 
10 Asparagine it is AAC; for Glutamic acid it is GAA; for Cysteine it is TGT; for Arginine it is 
AGA; and finally for Glycine it is GGT. 

The codon usage of a DNA sequence encoding a polypeptide is adjusted to 
match the preferred codon usage of P methanolica when at least 50% of the codons for 
specific amino acids in the polypeptide sequence have been adjusted to a preferred 
15 codon as defined above, or preferably at least 60%, more preferably at least 70%, even 
more preferably 80%, yet more preferably 90%, and most preferably 95% of the codons 
for specific amino acids in the polypeptide sequence have been adjusted to a preferred 
codon as defined above. 

Further in a second aspect, the present invention relates to an isolated DNA 
20 construct comprising at least one copy of a nucleotide sequence encoding a polypeptide 
heterologous to Pichia methanolica, wherein the codon usage of said sequence has 
been adjusted to match the preferred codon usage of P. methanolica, as defined herein. 

Finally in a third aspect the present invention relates to a method of producing a 
polypeptide of interest in a Pichia cell, where the polypeptide is encoded by a nucleotide 
25 ^sequence heterologous to P methanolica, said method comprising the steps of: 

a) adjusting the codon usage of the sequence to match the preferred codon usage of P 
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methanolica, as defined herein; and 
b) cultivating a Pichia cell comprising at least one copy of the codon usage adjusted 
sequence of step a) under appropriate growth conditions to express the sequence 
and achieve production of the polypeptide of interest. 

5 

Deposited microorganisms 

The entire sequence of the synthetic codon optimized A. fumigatus phytase 
gene (SEQ ID No. 19) is comprised in the strain E. coli NN049526, which was deposited 
with "Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH" (DSMZ) 
1 0 01 -March-2000 with the accession number DSM 1 3352. 

The cDNA sequence of the Thermomyces lanuginosus DSM4109 xylanase 
gene was published in WO 96/32472. For this study we obtained a Saccharomyces 
cerevisiae strain from DSMZ, accession number DSM 10133, deposited on 
19-July-1995, which contains a plasmid comprising the full-length xylanase cDNA of 
15 the Thermomyces lanuginosus DSM4 109. 

The deposits were made by Novo Nordisk A/S and were later assigned to 
Novozymes A/S. 

Definitions 

20 The term "isolated nucleic acid sequence" as used herein refers to a nucleic 

acid sequence which is essentially free of other nucleic acid sequences, e.g., at least 
about 20% pure, preferably at least about 40% pure, more preferably at least about 
60% pure, even more preferably at least about 80% pure, and most preferably at least 
about 90% pure as determined by agarose electrophoresis. 

25 For example, an isolated nucleic acid sequence can be obtained by standard 

cloning procedures used in genetic engineering to relocate the nucleic acid sequence 
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from its natural location to a different site where it will be reproduced. The cloning 
procedures may involve excision and isolation of a desired nucleic acid fragment 
comprismy the nucleic acid sequence encoding the polypeptide, insertion of the 
fragmt-nt mto a vector molecule, and incorporation of the recombinant vector into a 
host ceN where multiple copies or clones of the nucleic acid sequence will be 
replicated 

In.* nucleic acid sequences of the present invention may be of genomic, 
cDNA RNA. scmisyntnetic, synthetic origin, or any combinations thereof and may be 
obtained from microorganisms of any genus. For purposes of the present invention, 
the term 'obtained from" as used herein in connection with a given source shall mean 
that the polypeptide encoded by the nucleic acid sequence is produced by the source 
or by a cell in which the nucleic acid sequence from the source has been inserted. 

The nucleic acid sequences may be obtained from a bacterial source. For 
example, these polypeptides may be obtained from a gram positive bacterium such as 
a Bacillus strain, e.g., Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, 
Bacillus circulans, Bacillus coagulans, Bacillus lautus f Bacillus lentus, Bacillus 
licheniformis. Bacillus megaterium, Bacillus stearothermophilus, Bacillus subtilis, or 
Bacillus thuringiensis: or a Streptomyces strain, e.g., Streptomyces lividans or 
Streptomyces murinus; or from a gram negative bacterium, e.g., E. coli or 
Pseudomonas sp. 

The nucleic acid sequences may be obtained from a fungal source, and more 
preferably from a yeast strain such as a Candida, Hansenula, Kluyveromyces, Pichia, 
Saccharomyces, Schizosaccharomyces, or Yarrowia strain; or more preferably from a 
filamentous fungal strain such as an Acremonium, Aspergillus, Aureobasidium, 

Cryptococcus, Filibasidium 1 —-Fusarium, _ Humicola, _ Magnaporthe, . Mucor, 

Myceliophthora, Neocaliimastix, Neurospora, Paecilomyces, Penicillium, Piromyces, 
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Schizophyllum, Talaromyces. Thermoascus. Thielavia. Tolypocladium, or Trichoderma 
strain. 

The nucleic acid sequences may thus be obtained from Saccharomyces 
carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, 
5 Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis or 
Saccharomyces oviformis: or indeed from from an Aspergillus aculeatus, Aspergillus 
awamori, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus 
niger, Aspergillus oryzae, Fusarium bactridioides, Fusarium cerealis, Fusarium 
crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, 

in Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium 
reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, 
Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium 
trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, 
Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium 

15 purpurogenum, Trichoderma harzianum, Trichoderma koningii, Trichoderma 
longibrachiatum, Trichoderma reesei, or Trichoderma viride strain. 

In a preferred embodiment, the nucleic acid sequence is the sequence 
encoding a phytase comprised in Escherichia coli DSM13352. In another preferred 
embodiment, the nucleic acid sequence is SEQ ID NO: 3 or SEQ ID NO: 19. 

20 It will be understood that for the aforementioned species, the invention 

encompasses both the perfect and imperfect states, and other taxonomic equivalents, 
e.g., anamorphs, regardless of the species name by which they are known. Those 
skilled in the art will readily recognize the identity of appropriate equivalents. Strains of 
these species are readily accessible to the public in a number of culture collections, 

25 such__as_the. American^Typa Culture Collection (ATCC), Deutsche ..Sammlung von 

Mikroorganismen und Zellkulturen GmbH (DSMZ), Centraalbureau Voor 
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Schimmelcultures (CBS), and Agricultural Research Service Patent Culture Collection, 
Northern Regional Research Center (NRRL). 

Furthermore, such nucleic acid sequences may be identified and obtained 
from other sources including microorganisms isolated from nature (e.g., soil, composts, 
5 water, etc.) using the above-mentioned probes. Techniques for isolating 
microorganisms from natural habitats are well known in the art. The nucleic acid 
sequence may then be derived by similarly screening a genomic or cDNA library of 
another microorganism. Once a nucleic acid sequence encoding a polypeptide has 
been detected with the probe(s), the sequence may be isolated or cloned by utilizing 
10 techniques which are known to those of ordinary skill in the art (see, e.g., Sambrook et 
a/., 1989, supra). 

The techniques used to isolate or clone a nucleic acid sequence encoding a 
polypeptide are known in the art and include isolation from genomic DNA, preparation 
from cDNA, or a combination thereof. The cloning of the nucleic acid sequences of 

15 the present invention from such genomic DNA can be effected, e.g., by using the 
well-known polymerase chain reaction (PGR) or antibody screening of expression 
libraries to detect cloned DNA fragments with shared structural features. See, e.g., 
Innis et a/., 1990, PCR: A Guide to Methods and Application, Academic Press, New 
York. Other nucleic acid amplification procedures such as ligase chain reaction (LCR), 

20 ligated activated transcription (LAT) and nucleic acid sequence-based amplification 
(NASBA) may be used. 

A nucleic acid sequence of the present invention may also encode fused 
polypeptides or cleavable fusion polypeptides in which another polypeptide is fused at 
the N-terminus or the C-terminus of the polypeptide or fragment thereof. A fused 

25 ppJypeptid_e_is_pro_duced_by..fusjn sequence (or a portion thereof) 

encoding another polypeptide to a nucleic acid sequence (or a portion thereof) of the 
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present invention. Techniques for producing fusion polypeptides are known in the art, 
and include ligating the coding sequences encoding the polypeptides so that they are 
in frame and that expression of the fused polypeptide is under control of the same 
promoter(s) and terminator. 
5 The present invention also relates to nucleic acid constructs comprising a 

nucleic acid sequence of the present invention operably linked to one or more control 
sequences which direct the expression of the coding sequence in a suitable host cell 
under conditions compatible with the control sequences. Expression will be 
understood to include any step involved in the production of the polypeptide including, 

10 but not limited to, transcription, post-transcriptional modification, translation, 
post-translationai modification, and secretion. 

"Nucleic acid construct" or "DNA construct" is defined herein as a nucleic acid 
molecule, either single- or double-stranded, which is isolated from a naturally 
occurring gene or which has been modified to contain segments of nucleic acid which 

15 are combined and juxtaposed in a manner which would not otherwise exist in nature. 
The term nucleic acid construct is synonymous with the term expression cassette 
when the nucleic acid construct contains all the control sequences required for 
expression of a coding sequence of the present invention. The term "coding 
sequence" is defined herein as a portion of a nucleic acid sequence which directly 

20 specifies the amino acid sequence of its protein product. The boundaries of the 
coding sequence are generally determined by a ribosome binding site (prokaryptes) or 
by the ATG start codon (eukaryotes) located just upstream of the open reading frame 
at the 5' end of the rnRNA and a transcription terminator sequence located just 
downstream of the open reading frame at the 3' end of the mRNA. A coding 

25 sequence can include, but is not limited to, DNA, cDNA, and recombinant nucleic acid 


sequences. 
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An isolated nucleic acid sequence encoding a polypeptide of the present 
invention may be manipulated in a variety of ways to provide for expression of the 
polypeptide. Manipulation of the nucleic acid sequence prior to its insertion into a 
vector may be desirable or necessary depending on the expression vector. The 
techniques for modifying nucleic acid sequences utilizing recombinant DNA methods 
are well known in the art. 

The term "control sequences" is defined herein to include all components 
which are necessary or advantageous for the expression of a polypeptide of the 
present invention. Each control sequence may be native or foreign to the nucleic acid 
sequence encoding the polypeptide. Such control sequences include, but are not 
limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal 
peptide sequence, and transcription terminator. At a minimum, the control sequences 
include a promoter, and transcriptional and translational stop signals. The control 
sequences may be provided with linkers for the purpose of introducing specific 
restriction sites facilitating ligation of the control sequences with the coding region of 
the nucleic acid sequence encoding a polypeptide. The term "operably linked" is 
defined herein as a configuration in which a control sequence is appropriately placed 
at a position relative to the coding sequence of the DNA sequence such that the 
control sequence directs the expression of a polypeptide. 

The control sequence may be an appropriate promoter sequence, a nucleic 
acid sequence which is recognized by a host cell for expression of the nucleic acid 
sequence. The promoter sequence contains transcriptional control sequences which 
mediate the expression of the polypeptide. The promoter may be any nucleic acid 
sequence which shows transcriptional activity in the host cell of choice including 
mutant, truncated, _and hybrid promoters, and may be obtained from genes en codin g 
extracellular or intracellular polypeptides either homologous or heterologous to the 


WO 01/66693 PCT/DK01/00154 

10 

host cell. 

Useful signal peptides for yeast host ceils are obtained from the genes for 
Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. 
Other useful signal peptide coding regions are described by Romanos et a/., 1992, 
5 Yeast 8: 423-488. 

It may also be desirable to add regulatory sequences which allow the 
regulation of the expression of the polypeptide relative to the growth of the host cell. 
Examples of regulatory systems are those which cause the expression of the gene to 
be turned on or off in response to a chemical or physical stimulus, including the 

10 presence of a regulatory compound. 

The present invention also relates to recombinant expression vectors 
comprising a nucleic acid sequence of the present invention, a promoter, and 
transcriptional and translational stop signals. The various nucleic acid and control 
sequences described above may be joined together to produce a recombinant 

15 expression vector which may include one or more convenient restriction sites to allow 
for insertion or substitution of the nucleic acid sequence encoding the polypeptide at 
such sites. Alternatively, the nucleic acid sequence of the present invention may be 
expressed by inserting the nucleic acid sequence or a nucleic acid construct 
comprising the sequence into an appropriate vector for expression. In creating the 

20 expression vector, the coding sequence is located in the vector so that the coding 
sequence is operably linked with the appropriate control sequences for expression. 

The recombinant expression vector may be any vector (e.g., a plasmid or 
virus) which can be conveniently subjected to recombinant DNA procedures and can 
bring about the expression of the nucleic acid sequence. The choice of the vector will 

25 typically depend on the compatibility of the vector with the host cell into which the 

vector is to be introduced. The vectors may be linear or closed circular plasmids. 
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The vector may be an autonomously replicating vector, i.e., a vector which 
exists as an extrachromosomal entity, the replication of which is independent of 
chromosomal replication, e.g., a plasmid, an extrachromosomal element, a 
mini-chromosome, or an artificial chromosome. The vector may contain any means 
5 for assuring self-replication. Alternatively, the vector may be one which, when 
introduced into the host cell, is integrated into the genome and replicated together with 
the chromosome(s) into which it has been integrated. Furthermore, a single vector or 
plasmid or two or more vectors or plasmids which together contain the total DNA to be 
introduced into the genome of the host cell, or a transposon may be used. 
10 The vectors of the present invention preferably contain one or more selectable 

markers which permit easy selection of transformed cells. A selectable marker is a 
gene the product of which provides for biocide or viral resistance, resistance to heavy 
metals, prototrophy to auxotrophs, and the like. 

Suitable markers for yeast host cells are ADE1 , ADE2, HIS3, LEU2, LYS2, MET3, 

15 TRP1 , and URA3. 

The vectors of the present invention preferably contain an element(s) 
that permits stable integration of the vector into the host cell genome or autonomous 
replication of the vector in the cell independent of the genome of the cell. For 
integration into the host cell genome, the vector may rely on the nucleic acid sequence 

20 encoding the polypeptide or any other element of the vector for stable integration of 
the vector into the genome by homologous or nonhomologous recombination. 
Alternatively, the vector may contain additional nucleic acid sequences for directing 
integration by homologous recombination into the genome of the host cell. The 
additional nucleic acid sequences enable the vector to be integrated into the host cell 

25 genome at a precise location(s) in the chromosome(s)._ To increasejhe likejihood of 

integration at a precise location, the integrational elements should preferably contain a 
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sufficient number of nucleic acids, such as 100 to 1,500 base pairs, preferably 400 to 
1,500 base pairs, and most preferably 800 to 1,500 base pairs, which are highly 
homologous with the corresponding target sequence to enhance the probability of 
homologous recombination. The integrational elements may be any sequence that is 
5 homologous with the target sequence in the genome of the host cell. Furthermore, 
the integrational elements may be non-encoding or encoding nucleic acid sequences. 
On the other hand, the vector may be integrated into the genome of the host cell by 
non-homologous recombination. 

For autonomous replication, the vector may further comprise an origin of 

10 replication enabling the vector to replicate autonomously in the host cell in question. 
Examples of origins of replication for use in a yeast host cell are the two micron origins 
of replication, ARS1, ARS4, the combination of ARS1 and CEN3, and the combination 
of ARS4 and CEN6. The origin of replication may be one having a mutation which 
makes its functioning temperature-sensitive in the host cell (see, e.g., Ehrlich, 1978, 

15 Proceedings of the National Academy of Sciences USA 75: 1433). 

More than one copy of a nucleic acid sequence of the present invention may 
be inserted into the host cell to increase production of the gene product. An increase 
in the copy number of the nucleic acid sequence can be obtained by integrating at 
least one additional copy of the sequence into the host cell cjenome or by including an 

20 amplifiable selectable marker gene with the nucleic acid sequence where cells 
containing amplified copies of the selectable marker gene, and thereby additional 
copies of the nucleic acid sequence, can be selected for by cultivating the cells in the 
presence of the appropriate selectable agent. 

The procedures used to ligate the elements described above to construct the 

25 recombinant expression vectors of the presen t in vention are well known to one skilled 

in the art (see, e.g., Sambrook et a/., 1989, supra). 
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The present invention also relates to recombinant host cells, 
comprising a nucleic acid sequence of the invention, which are advantageously used 
in the recombinant production of the polypeptides. A vector comprising a nucleic acid 
sequence of the present invention is introduced into a host cell so that the vector is 
5 maintained as a chromosomal integrant or as a self-replicating extra-chromosomal 
vector as described earlier. The term "host cell" encompasses any progeny of a parent 
cell that is not identical to the parent cell due to mutations that occur during replication. 
The choice of a host cell will to a large extent depend upon the gene encoding the 
polypeptide and its source. 

10 Fungal cells may be transformed by a process involving protoplast formation, 

transformation of the protoplasts, and regeneration of the cell wall in a manner known 
perse. Suitable procedures for transformation of Aspergillus host cells are described 
in EP 238 023 and Yelton et a/., 1984, Proceedings of the National Academy of 
Sciences USA 81: 1470-1474. Suitable methods for transforming Fusarium species 

15 . are described by Malardier et a/., 1989, Gene 78: 147-156 and WO 96/00787. Yeast 
may be transformed using the procedures described by Becker and Guarente, In 
Abelson, J.N. and Simon, M.I., editors, Guide to Yeast Genetics and Molecular Biology , 
Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New York; 
Ito et a/., 1983, Journal of Bacteriology 153: 163; and Hinnen et a/., 1 978, Proceedings 

20 of the National Academy of Sciences USA 75: 1 920. 

The present invention also relates to methods for producing a 
polypeptide comprising cultivating a host cell under conditions suitable for production 
of the polypeptide. In the production methods of the present invention, the cells are 
cultivated in a nutrient medium suitable for production of the polypeptide using 

25 methods known in the art. For example, the cell may be '..cultivated by shake flask 
cultivation, small-scale or large-scale fermentation (including continuous, batch, 


* 
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fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed 
in a suitable medium and under conditions allowing the polypeptide to be expressed 
and or iscUit-.: The cultivation takes place in a suitable nutrient medium comprising 
caroon am: nitrjgen sources and inorganic salts, using procedures known in the art. 
5 SuiUitm* m-du are available from commercial suppliers or may be prepared according 
to pLjWisrv;<i compositions (e.g., in catalogues of the American Type Culture 
Collection. I? tne polypeptide is secreted into the nutrient medium, the polypeptide 
can be r»--;>v<rr. -j directly from the medium. If the polypeptide is not secreted, it can 
be reco\t:!>*o from cell Ivsates. 
10 Tnv polypeptides may be detected using methods known in the art that are 

specific icr tru: polypeptides. These detection methods may include use of specific 
antibodies, formation of an enzyme product, or disappearance of an enzyme substrate. 
For example, an enzyme assay may be used to determine the activity of the 
polypeptide as described herein. 

The resulting polypeptide may be recovered by methods known in the art. 
For example, the polypeptide may be recovered from the nutrient medium by 
conventional procedures including, but not limited to, centrifugation, filtration, 
extraction, spray-drying, evaporation, or precipitation. 

20 The polypeptides may be purified by a variety of procedures known in the art 

including, but not limited to, chromatography (e.g., ion exchange, affinity, hydrophobic, 
chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative 
isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), 
SDS-PAGE, or extraction (see, e.g., Protein Purification, J.-C. Janson and Lars Ryden, 

25 editors, VCH Publishe/s.,^ 


» 
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Detailed description of the invention 

A Pichia cell comprising at least one copy of a heterologous nucleotide 
sequence encoding a polypeptide of interest, wherein the codon usage of said sequence 
has been adjusted to match the preferred codon usage of P. methanolica, as defined 
5 herein. 

The specific Pichia cell may be any Pichia cell, such as a Pichia methanolica, 
Pichia pastoris, Pichia anomala, Pichia stipitis, Pichia angusta or Pichia guilliermondii cell. 
Preferably the specific Pichia cell is a Pichia methanolica cell. 

A preferred embodiment of the invention relates to a Pichia cell of the first aspect 
10 of the invention, wherein the nucleotide sequence encodes an enzyme, preferably the 
nucleotide sequence encodes an enzyme having phytase or xylanase activity, or even 
more preferably the nucleotide sequence comprises a DNA sequence at least 90% 
identical, or preferably at least 93% identical, or more preferably at least 95% identical, 
or most preferably at least 97% identical to the sequences shown in SEQ ID No. 3, SEQ 
15 ID No. 19, or to the phytase encoding sequence comprised in a cell of DSM 13352. 

As is discussed elsewhere herein, there are a number of ways to introduce a 
heterologous DNA sequence into a host cell known in the art. 

A further embodiment relates to a Pichia cell of the first aspect, wherein the 
nucleotide sequence is present on an extrachromosomal DNA construct, preferably a 
20 plasmid, or more preferably the nucleotide sequence is integrated in the cell's 
chromosome, preferably in multiple copies. 

Also well known in the art are ways of expressing heterologous DNA sequences 
encoding polypeptides of interest in host cells. Several promoters have been described 
to work well in Pichia cells as described above. 

25 ... Accordingly a preferred embodiment of .the. invention, relates. \o_a. Pichia cell of 

the first aspect, wherein the nucleotide sequence is transcribed from a promoter of a 
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methanol-inducible P. methanolica gene, preferably the promoter is a native promoter of 
a P methanolica gene encoding an enzyme selected from the group consisting of 
alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase. 

For industrial production of polypeptides it is of considerable interest to have the 
5 polypeptides secreted from the producing host cells in order to minimize costs of 
isolating the polypeptide, ways of promoting efficient secretion are well known in the art. 
A polypeptide may be efficiently secreted if it carries an effective secretion signal peptide 
recognized by the host cell's secretion apparatus. A number of secretion signal peptides 
have been described to work well in yeast host cells as described elsewhere herein. The 

10 present inventors found that even without the addition of a well-characterized yeast 
secretion signal peptide, efficient secretion could be achieved from a host cell of a 
polypeptide carrying its native original secretion signal sequence. 

A preferred embodiment of the invention relates to a Pichia cell of the first aspect, 
wherein the polypeptide comprises its native original secretion signal sequence. 

15 The present invention relies on adjusting the codon usage of an isolated DNA 

sequence to match the preferred codon usage of Pichia methanolica. An isolated DNA 
construct comprising at least one copy of a nucleotide sequence encoding a 
polypeptide heterologous to Pichia methanolica, wherein the codon usage of said 
sequence has been adjusted to match the preferred codon usage of R methanolica, 

20 as defined herein. 

A preferred embodiment of the invention relates to an isolated DNA construct of 
the second aspect of the invention, wherein the nucleotide sequence encodes an 
enzyme, preferably the nucleotide sequence encodes an enzyme having phytase or 
xylanase activity, and more preferably the nucleotide sequence comprises a DNA 

25 seq uence at least 90% identical, or preferably at least 93% identical, or more ^ preferably 
at least 95% identical, or most preferably at least 97% identical to the sequences shown 


I 
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in SEQ ID No. 3, SEQ ID No. 19, or to the phytase encoding sequence comprised in a 
cell of DSM 13352. 

Another preferred embodiment of the invention relates to an isolated DNA 
construct of the second aspect of the invention, wherein the nucleotide sequence is 
5 transcribed from a promoter of a methanol-inducible R methanolica gene, preferably the 
promoter is a native promoter of a R methanolica gene encoding an enzyme selected 
from the group consisting of alcohol oxidase, dihydroxyacetone synthase, formate 
dehydrogenase, and catalase. Other promoters generally recognized as working well in 
Pichia cells are those of the glycolytic enzymes, such as glyceraldehyde-3-phosphate 
10 dehydrogenase, tnose phosphate isomerase, enolase etc. 

Further a preferred embodiment of the invention relates to an isolated DNA 
construct of the second aspect of the invention, wherein the nucleotide sequence 
encodes a polypeptide comprising its native original secretion signal sequence. 

As mentioned previously the invention relates to industrial production, i.e. a 
15 method of producing a polypeptide of interest in a Pichia cell, where the polypeptide is 
encoded by a nucleotide sequence heterologous to R methanolica, said method 
comprising the steps of: 

a) adjusting the codon usage of the sequence to match the preferred codon usage of R 
methanolica, as defined herein; and 
20 b) cultivating a Pichia cell comprising at least one copy of the codon usage adjusted 
sequence of step a) under appropriate growth conditions to express the sequence 
and achieve production of the polypeptide of interest. 

A preferred embodiment of the invention relates to a method of the third aspect, 
wherein the Pichia cell is a Pichia methanolica cell. 

25 Yet_another preferred„embodiment of the invention relates to a method of the 

third aspect, wherein the nucleotide sequence encodes an enzyme, preferably the 
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nucleotide sequence encodes an enzyme having phytase or xylanase activity, or more 
preferably the nucleotide sequence comprises a DNA sequence at least 90% identical, 
or preferably at least 93% identical, or more preferably at least 95% identical, or most 
preferably at least 97% identical to the sequences shown in SEQ ID No. 3, SEQ ID No. 
5 19, or to the phytase encoding sequence comprised in a cell of DSM 13352. 

The present inventors succeeded in obtaining much improved yields of 
polypeptides of interest by using a method of the third aspect of the invention, in one 
non-limiting example they even obtained a yield that exceeded 10 times the yield 

r 

■ obtained with the conventional method. 

]0 Consequently a preferred embodiment relates to a method of the third aspect, 

wherein the polypeptide is produced in a yield at least three times higher than what is 
obtained from an otherwise identical cell comprising a sequence not adjusted to the 
preferred codon usage of P. methanolica, as defined herein, preferably at least five 
times higher, more preferably seven, even more preferably eight, yet more preferably 

15 nine, and most preferably ten times higher than what is obtained from an otherwise 
identical cell comprising a sequence not adjusted to the preferred codon usage of R 
methanolica, as defined herein. 

in parallel with the two first aspects, preferred embodiments relate to methods of 
the third aspect, wherein the nucleotide sequence is transcribed from a promoter of a 

20 methanol-inducible R methanolica gene, preferably the promoter is a native promoter of 
a R methanolica gene encoding an enzyme selected from the group consisting of 
alcohol oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase; and 
preferably methanol is added to the cell during the cultivating step, whereby the 
methanol-inducible promoter is induced; it is also preferred that the nucleotide sequence 

25 encodes a polypeptide comprising its native original secretion sign al se quence. 
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Material and methods 

Purchased material (E.coli, plasmid and kits) 

E.coii DH12S (Gibco BRL) and E.coli JM109 (Toyobo) were used for plasmid 
construction and amplification. Amplified plasmid was recovered with Qiagen*' Plasmid 
5 Kit (Qiagen). Ligation was done with DNA ligation kit (Takara) or T4 DNA ligase 
(Boehringer Mannheim). Polymerase Chain Reaction (PCR) was carried out either with 
Taq DNA polymerase (Boehringer Mannheim) or with Expand ™ PCR system especially 
Expand™ Long Template (Boehringer Mannheim). QIAquick™ Get Extraction Kit 
(Qiagen) was used for the purification of PCR fragments and extraction of DNA fragment 
10 from agarose gel. A plasmid pT7blue T-vector (Novagen) was used for the cloning of 
PCR fragments. 

Expression host strain 

The expression host strain is a Pichia methanolica PMAD16 (ade2- pep4AprblA) 
1 5 described in WO 99/14347. It is a vacuolar protease deficient mutant originally generated 
from P. methanolica CBS 6515. Two major vacuolar proteinases, i.e. Proteinase A 
encoded by the pep4 gene and Proteinase B encoded by the prbl gene, are inactivated 
by disrupting these genes (WO 99/14347). 

20 Expression plasmid 

The expression cassette plasmid pCZR134 is described in WO 99/14347. It comprises 
the promoter and terminator of the alcohol oxidase gene AUG1 from P methanolica (WO 
99/14347) and the ADE2 gene encoding phosphoribosyl-5-aminoimidazole carboxylase 
(WO 99/14347) from P. methanolica. 

25 Another, expression^ cassette .plasmid is pGAP, which Js ..identical to pCZ R134 b ut 

contains the promoter of the gene encoding the glyceraldehyde-3-phophate 
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dehydrogenase GAP1 from P. methanolica (described in WO00/56903. SEQ ID No. 8 
therein) instead of the AUG1 promoter as described above. 

Transformation of P methanolica 
5 Transformation of Pichia methanolica can be done using the general methods for yeast 
transformation. As the most preferred method, the procedure of electro transformation is 
described below. The P methanolica host strain is grown in non-selective YPD medium 
(20g/l of Pepton, Difco, 1 0g/I of yeast extract, Difco and 20g/l of glucose) at 37°C for 
overnight with vigorous aeration, and the culture is diluted 100-fold in 200ml fresh, 

10 pre-warmed YPD. This is further cultivated at 37°C until the optical density at 663nm 
becomes 1.2-1.5, which corresponds to 9x1 0 7 cells/ml. Cells are collected by 
centrifugation at 3000 g for 10 minutes, and re-suspended in 40ml of 50m M potassium 
phosphate buffer pH 7.5 containing 25mM of dithiothreitol (DTT), then incubated at 37°C 
for 15 minutes. The cells are washed twice with ice-cold electroporation buffer STM 

15 (270mM sucrose, 10mM Tris pH7.5 and ImM CaCI2), at first with 200 ml (the original 
culture volume), then with 100 ml (50% original volume). Finally the cells are 
re-suspended in 1 ml (0.5% of original volume) ice-cold STM buffer to a concentration of 
approximately 2x1 0 10 electrocompetent cells/ml. 

0.1-10 M9 of DNA (volume < 10[jl) is added to 100pl of electrocompetent cells 

20 and placed into an ice-cold electroporation cuvette with an electrode gap of 2mm. A 
pulsed electric field is provided at 3.75 kV/cm for 20 millisecond, using a BioRad Gene 
Pulser™ electroporator on which resistance is set to >600 Q or "infitine" and capacitance 
is set to 25 pF. After delivering the pulse, 1ml of YPD is added to the cell/DNA mixture 
and incubated for 1 hour at 30°C. 

25 Cells are harvested by centrifugation at 3000 g for 5 minutes, then washed with 

1ml of yeast nitrogen base (6.7 g/l yeast nitrogen base without amino acids, Difco), and 
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plated on minimal selective media, e.g. SC glucose. Host cells having an ade2 mutation 
appear as pink colonies on minimal medium, whereas transformants that acquired a 
stable integration (Ade+) can grow rapidly as white colonies on minimal medium. 

5 Example 1 

A cDNA clone of the Thermomyces xylanase gene 

The cDNA sequence of the Thermomyces lanuginosus DSM4109 xylanase 
gene was published (WO 96/32472). For this study we obtained A Saccharomyces 
cerevisiae strain, DSM 10133, containing a plasmid comprising the full-length 
1 0 xylanase cDNA of the Thermomyces lanuginosus DSM4109. 

Construction of expression plasmid pCaXH 

The Thermomyces xylanase cDNA was amplified by PCR using primers xyN 
and xyC to introduce desirable restriction enzyme cleavage sites at both ends of the 
15 PCR fragment: 

xyN (SEQ ID 1 ): CCGGAATTCATGGTCGGCTTTACCCCCGTT. 
XyC (SEQ ID 2): AAGGAAAAAAGCGGCCGCACTAGTTTAGCCCA 

CGTCAGCAAC. 

20 The PCR reaction mixture comprised 50 ng of template cDNA, 0.05mM of 

dNTPs, 100 pmol of each primer and 1unit of Taq polymerase in 5Qpl of Taq buffer with 
MgCI2 (Boehringer Mannheim). The reaction was initiated at 94°C for 3.5 minutes 
followed by 30 cycles of 94°C for 1.5 minutes, 50°C for 1.5 minutes and 72°C for 3 
minutes, with the final extension at 72°C for 10 minutes. The amplified 0.7kb fragment 

t2 was digested with EcpRI and Spel, and purjfiedjpn J. 5% agarose gel : _Obtained„fragment. 

was subjected to the ligation with pCZR134 linearized by EcoRI and Spel. The resulting 
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Gene synthesis of Themomyces xylanase gene 

The Themomyces xylanase gene was synthesized to change the codon usage 
to have A or T as the preferred third letter of each codon. This is not a full optimization to 
the preferred codon usage of Pichia methanolica, as defined herein, but rather a first 
attempt to see if there is any advantage in optimizing the codons further. Two fragments 
XyC12 and XyC37 were synthesized by two-step PCR. The resulting designed xylanase 
encoding sequence is shown as SEQ ID 3. 

Oligonucleotides used for the first PCR reaction were as follows: 
For the synthesis of XyC12: 

xyC1 (SEQ ID 4): atggttggttttactccagttgctttagctgctttagctgctactggtgctttagctttc 
ccagcaggtaatgctactgaattagaaaaaagacaaacaactccaaattctgaaggttgg. 

xyC2r(SEQ ID 5): cataggtaccaccttctaaattagtataagtagcttgagcaccaccatca 
gaccaccaagaataataataaccatcatgccaaccttcagaatttggaga. 

For the synthesis of XyC37: ~ 

xyC3 (SEQ ID 6): ggtacctatgaaatttcttggggtgatggtggtaatttagttggtggtaaaggttggaat 

ccaggtttaaatgcaagagctattcattttgaaggtgtttatcaaccaaatggtaattct. 
xyC4r(SEQ ID 7): ctagtagttttacctaatctataaatagaaccatcacattcaacagtacctaaatcag 

tagcaccagaagatggatcataagtaccaaaattttcaacaatataatattcaactaatgga. 
xyC5 (SEQ ID 8): ttctggtgctactgatttaggtactgttgaatgtgatggttctatttatagattaggtaa 

aactactagagttaatgcaccatctattgatggtactcaaactttcgatcaatattggtc. 
xyC6r (SEQ JD 9): taataatgat caccat taacattcaaaccagctctagcccaagcatcgaaatgac 

aaccagtttgaacagtaccagaagttcttttatcttgtctaacagaccaatattgatcgaaagtt. 
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xyC7 (SEQ ID 10): ttaatggtgatcattattatcaaattgttgcaactgaaggttatttctcttctggttatg 
eta g a a tta ctg ttg ct g a t g tt g g tta a . 

Oligonucleotides used for the second PCR reaction were as follows: 
For the synthesis of XyC 12: 

xyc12N (SEQ ID 11): ataagaatgcggccgcatggttggttttactccagt. 
xyc12C (SEQ ID 12): tcataggtaccaccttctaaattagtataa. 

For the synthesis of XyC37: 

xyc37N (SEQ ID 13): ttagaaggtggtacctatgaaatttcttggggtgat. 
xyc37C (SEQ ID 14): ttttccttttgcggccgcttaaccaacatcagcaac. 

The first PCR reaction mixture contained 1pmol of each oligonucleotide, 0.35mM 
dNTP, 2.6 units of Expand™ polymerase in 5p|jl of provided buffer. Reaction started at 
94°C for 1 minute followed by 6 cycles of 94°C for 0.5 minutes, 50°C for 5 minutes and 
72°C for 1 minute with the final extension at 72°C for 10 minutes. Five pi of resulting first 
PCR reaction solution was used for the second PCR with 0.35mM dNTP, 100 pmol of 
each oligonucleotide and 2.6 units of Expand™ polymerase in 5Qpl of provided buffer. 
The reaction was carried out at 94°C for 3.5 minutes first followed by 30 cycles of 94°C 
for 1.5 minutes, 50°C for 1:5 minutes and 72°C for 3 minutes with the final extension at 
72°C for 10 minutes. The amplified fragments were purified on 4% agarose gel then 
ligated into pT7b!ue T-vector resulting pTXyC12-7 and pTXyC37-715 respectively. They 
were confirmed to have no PCR error by sequencing. A Kpnl fragment of around 200bp 
was isolated from pTXyC12-7 then ligated with a 3.7kb of Kpnl fragment from 
_. pTXyC37_-7 1 5. The resulting^ plasmid pTXyC17-1 comprises the sequence of , the 
codon-optimized synthesized Themomyces xylanase gene (SEQ ID 3). 
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Construction of expression plasmid pCaXS3 

The synthesized Thermomyces xylanase gene was amplified by PCR using the 

following primers; 

5 

a) (SEQ ID 15): CCGGAATTCATGGTTGGTTTTACTCCAGTTGCT. 

b) (SEQ ID 16): GGACTAGTTTAACCAACATCAGCAACAGTAAT. 

The reaction mixture comprised 10 ng of synthesized xylanase gene, 0.35 mM 
0 of dNTP, 100 pmol of each primer and 2.6 units of Expand™ polymerase in SOpI of 
provided buffer with MgCI2. The PCR reaction was initiated at 94°C for 3.5 minutes 
followed by 30 cycles of 94°C for 1.5 minutes, 55°C for 1.5 minutes and 68°C for 3 
minutes with the final extension at 72°C for 10 minutes. The amplified gene was digested 
with EcoRI and Spel, and ligated into pCZR134 cut by EcoRI and Spel beforehand and 
5 the resulting plasmid was termed pCaXS3. 

Transformation of R methanolica with pCaXM and pCaXS3 

The expression plasmids pCaXM and pCaXS3 were digested with Not! and 

6.2kb fragments were used for the transformation of R methanolica PMAD16 to achieve 
0 a Ade+ phenotype. Transformed ceils were grown on SC glucose (2% glucose, 0.01% 

tryptophan, 0.02% threonine, 0.5% casamino acid, in basal salt without amino acids (per 

litre, 6.68g yeast nitrogen base w/o amino acid (Difco), 10g succinic acid, 6g NaOH, and 

2% Agar Noble (Difco) containing 1 .2Msorbitol). Rapidly growing white colonies were 

isolated as stable transformants. 
:5_ ^Obtained .transformants were grown o.n_YPM agar (20g/l of Pepton, Difco, 10g/l 

of yeast extract, Difco, 20g/l of Agar Noble, Difco and 20ml/i of methanol) containing 
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0.1% AZCL™-xylan (Megazyme™) for 3-5 days at 30°C. Positive transformants were 
identified as the colonies surrounded by a blue halo. 

Xylanase producing cells were inoculated in YPD medium (20g/l of Pepton, 
Difco, 1 0g/I of yeast extract, Difco and 20g/l of glucose) in a 500 ml shaking flask, and 
cultivated under vigorous aeration overnight at 30°C. One ml of methanol was added to 
each shaking flask and it was cultivated one more day Cells were removed by 
centhfugation and xylanase activity in the supernatant was measured using 0.5% 
Azo-Wheat Arabinoxylan (Megazyme™) as substrate. The measurement reaction was 
employed at 30°C, pH 6 for 30 minutes and released colour was measured as the 
absorbance at 585nm. The amount of enzyme protein was estimated from the activity. 
The results are shown below; our conclusion to this first attempt is that the codon 
optimization did indeed have a significant positive effect on the obtainable yield from a 
Pichia cell. However, since we did not fully optimize the codons of the xylanase encoding 
sequence to match the preferred codon usage of R methanolica, it may be possible to 
obtain even greater yields by adjusting the sequence thoroughly. 

Strain Plasmid Gene : Yield in shaking flask 

PXI242 pCaXM intact gene 12mg/l 

PXS5 pCaXS3 synthesized gene 20 mg/l 

A transformant carrying the synthesized Thermomyces xylanase gene was 
inoculated in two flasks with 110ml of YEPD (10g/l Bacto yeast extract, 20g/l Bacto 
peptone and 20g/l glucose) in 500ml shaking flasks then cultivated under vigorous 
aeration condition at 30°C for 16-18 hours. Most of the cells were used for the 
subsequent inoculation of the fermentor. The fermentation vessel was filled with 2 liters 
of media containing. 43g (NH4)2S04, 5.2g K2HP04,J9g KH2PO4,„0.8g FeS04.7H20, 
2.3g sodium citrate with 2aq. After autoclaving and cooling down to 30°C, 94ml of 60% 
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glucose, 40ml of 1M MgS04.7H20, 4ml of 1M CaCI2, 40ml of trace metal solution 
(containing, per litter, 27.8g Fe S04.7H20, 0.5g CuS04.5H20, 1.09g ZnCI2, 1.35g 
MnS04.H20. 0.48g COCI2.6H20, 0.24g Na2Mo04.2 H20, 0.5g H3B03, 0.08g Kl, 0.26g 
NiS04.6 H20, 0.56g thiamine hydrochloride, 5mg biotin, 1ml H2S04) and 4rnl of vitamin 
5 solution (containing,' per litter, 47g inositol, 25g calcium pantothenate, 1.3g pyridoxine 
hydrochloride, 5.6g thiamine hydrochloride, 0.1g biotin) were added. The pH of the 
fermentation media was adjusted to 5.0 and during the fermentation pH was controlled 
automatically with 25% ammonia water and 10% H3PO4. Aeration was provided as 
compressed air at a flow rate of 3 litters per minute initially and agitation at the rate of 

10 100 rphn. After inoculation the agitation was controlled automatically from 100 rpm to 
1000 rpm to keep the dissolved oxygen level at minimum 30% of saturation. Following 
exhaustion of glucose after inoculation, a glucose feed was initiated at 12 hours at the 
rate of 0.27% glucose/hour. The feed was increased stepwise to 0.38% glucose/hour at 
18 hours and to 0.5% glucose/hour at 30 hours. After 40 hours, methanol induction of the 

15 AUG1 promoter was started by methanol feed at the rate of 0.3% methanol/hour, in 
parallel the glucose feed rate was reduced to 0.38% glucose/hour. Finally mixed feeding 
(0.6% methanol/hour, 0.27% glucose/hour) was run for the rest of fermentation. 
Increasing methanol feeding elevated the enzyme productivity. The yield of 
Thermomyces xylanase was calculated to be 750 mg/l with the 110g/kg of dried cells at 

20 170 hours. 

Example 2 

cDNA clone of A. fumigatus phytase gene 

The sequence of the phytase gene from Aspergillus fumigatus ATCC 13073 was 

25 published prevjpusJy,(EP^08970 10). This phytase contains an amino acid mutation Q27L, 

and the phytase of the invention comprises a further mutation at the second amino acid, 
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V2I, due to the introduction of a restriction enzyme cleavage site around the translation 
initiation codon. 


Construction of expression plasmid pCaPhy3 

A cDNA of A fumigatus phytase gene was amplified by PCR using the primers 
shown below; 


A1) (SEQ ID 17): CCGGAATTCATGATTACTCTGACTTTCCTGCTTTCGGCGG 
CGTATCTGCTTTCTGGTAGAGTGTCTGCGGCACCTAGTTCTGCTGGCTC. 
] 0 B1 ) (SEQ ID 18): CTAGACTAGTCTAGTCAACTAAAGCACTCTCCCCA. 


The reaction mixture comprising 10 ng of cDNA clone of A.fumigatus phytase, 
0.35 mM dNTPs, 100 pmol of each primers and 2.6 units of Expand™ polymerase in 
50pl of provided buffer with MgG2, was incubated at 94°C for 2 minutes followed by 10 

15 cycles of 94°C for 10 seconds, 55°C for 30 seconds, 68°C for 45 seconds. After that 
another 20 cycles were performed as 94°C for 10 seconds, 55°C for 30 seconds, 68°C 
for 45 plus n x 20 seconds (n: cycle number). 

The final extension was at 68°C for 7 minutes. Amplified 1 .3kb fragment was 
purified on 1.2% agarose gel and digestion with EcoRI and Spel. It was used for the 

20 ligation with pCZR134 digested with the same restriction enzymes and the obtained 
plasmid was termed pCaPhy3. 


Construction of the expression plasmid pCPS40 

DNA encoding the A. fumigatus phytase was synthesized, the sequence was 
25 — designed based-on the codon usage of two genes from Pichia methanolica, i.e. ADE.1._ 
and AUG1 . Codon usage of designed sequence is shown in table 1 . 
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This designed codon optimized phytase DNA sequence is shown in (SEQ ID 19). 


Table i 


Ammo 

Codon 

P.methanoli 

P.methanoli 

Original 

Optimized j 

acid 


ca ADE1 

ca AUG1 

phytase 
gene 

phytase 
gene 

F 

1 1 1 

9 

10 

12 

12 


TTC 

6 

19 

15 

15 

i 

^_ 

TTA 

17 

16 

0 

31 


TTG 

14 

19 

10 

10 


CTT 

1 

2 

7 

0 


CTC 

2 

0 

5 

0 


CTA 

7 

3 

4 

4 i 


CTG 

2 

0 

19 

0 

I 

— 

ATT 

19 

22 

8 

9 


ATC 

18 

12 

6 

6 


ATA 

5 

0 

1 

0 

M 

ATG 

17 

17 

6 

6 

V 

GTT 

29 

36 

6 

25 


GTC 

12 

4 

4 

4 


GTA 

0 

0 

3 

0 


GTG 

3 

0 

16 

0 

s 

TCT 

10 

20 

6 

15 


TCC 

5 

10 

10 

17 


TCA 

9 

11 

3 

16 


TCG 

4 

0 

13 

0 


AGT 

2 / 

0 

9 

0 


AGC 

3 

0 

7 

0 


CCT 

7 

1 

7 

7 


CCC 

1 

0 

4 

0 


■ 

CCA 

19 

44 

4 

15 


CCG 

5 

0 

7 

0 

T 

ACT 

22 

31 

11 

21 


ACC 

6 

11 

9 

9 


ACA 

4 

2 

1 

1 


ACG 

2 

0 

10 

0 

A 

GCT 

27 

40 

9 

25 


GCC 

14 

7 

12 

18 


GCA 

7 

1 

6 

0 


GCG 

3 

0 

16 

0 i 

Y 

TAT 

5 

9 

4 

4 


TAC 

9 

20 

14 

14 

H 

CAT 

8 

4 

4 

4 


CAC 

7 

18 

4 

4 

_ -_Q__ 

CAA 

18 

12 

2 

15 


CAG 

2 

0 

13 

0 

i N 

AAT 

11 

2 

6 

6 i 

i 


f 
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AAC 

13 

30 

! 13 

19 

K 

AAA 

i 15 

7 

i 1 

1 
■ 


AAG 

i 17 

31 

! 25 

25 

D 

GAT 

15 

30 

8 

8 


GAC 

13 

18 

17 

17 

i * 

E 

GAA 

20 

33 

4 

20 


GAG 

7 

3 

16 

0 

c 

TGT 

9 

1 1 

2 

10 


TGC 

1 
■ 

o 

ft 

o 

w 

TGG 

2 

10 


\J 

R 

CGT 

2 

1 


1 

1 


CGC 

o 

n 

7 

n 


CGA 

o 

n 

o 

n 


CGG 

0 

n 


0 


AGA 

14 

31 

2 

18 


AGG 

r 2 

n 

1 
i 

1 

G 

GGT 

26 

JO 


33 


GGC 

11 

1 

16 

o 


GGA 

3 

0 

6 

0 


GGG 

2 

0 

8 

0 

Stop 

TAA 

0 

0 

0 

0 


TAG 

0 

1 

0 

0 


TGA 

1 

0 

1 

1 

Total 


544 

665 

466 

466 


Seventy oligonucleotides (SEQ ID's 20-89) were prepared for the construction of 
the codon optimized phytase gene, and subjected to PCR using Expand ™ Long 
Template (Boehringer Mannheim) for assembly. The reaction mixture comprised 125 
5 pmoi of each of the seventy oligonucleotides shown in SEQ ID'S 20-89, 0.2 mM dNTPs 
and 2.6 units of Expand ™ polymerase in 50pl of provided buffer. The reaction was 
employed as 55 cycles of 94°C for 30 seconds, 52°C for 30 seconds and 68°C for 30 
seconds. The second PCR was carried out using the resulting first reaction mixture as 
template and the following set of primers: 


c) (SEQ ID 90): CGGAATTCATGATTACTTTAACTTTCTTATTATCAGCTTATTTA. 

d) (SEQ ID 91 ): GGACTAGTTCAAGAAAAACATTCACCCC. 


WO 01/66693 PCT/DK01/00154 

30 

The reaction mixtures comprised 0.5pl, 1 pi or 2pl of the first PCR mixture, 30 
pmot of each primer, 0.2 mM dNTPs and 2.6 units of Expand ™ polymerase in 50pl of 
provided buffer. The reaction was employed as 25 cycles of 94°C for 30 seconds, 50°C 
for 30 seconds and 68°C for 60 + 10 x n seconds (n: cycle number). An amplified 1.3kb 
5 fragment was purified with QIAquick and cloned into pT7blue T-vector. Two independent 
plasmids pTPhyS and pTPhy8 were selected for further construction, they had a couple 
of PCR generated sequence errors within the Hindlll-Spel region of pTPhyS and the 
EcoRI-Hindlll region of pTPhy8. 

Each piasmid was digested with Hindi!!, and a 3.4kg fragment from pTPhyS and 
I0 a 0.7kb fragment from pTPhy8 were isolated and ligated together, resulting in the 
piasmid pTPhy58, which comprises the entire sequence of the synthetic A fumigatus 
phytase gene. 

The piasmid pTPhy58 was digested with EcoRI and Spel, and a 1 ,3kb fragment 
was isolated and ligated ipto the expression vector pCZR134. The resulting phytase 
15 expression piasmid was designated pCPS40. 

The entire sequence of the synthetic codon optimized A. fumigatus phytase 
gene (SEQ ID No. 19) is comprised in the strain E. coli NN049526, which was deposited 
with DSMZ 01-March-2000 with the accession number DSM 13352. The deposit was 
made by Novo Nordisk A/S and was later assigned to Novozym.es A/S. 

20 

Transformation of P. methanolica with pCaPhy3 and pCPS40 

The expression plasmids pCaPhy3 and pCPS40 were digested with Notl and 
6.7kb fragments were used for the transformation of R methanolica PMAD16. 
Transformed cells were grown on minimal medium SC glucose containing 1.2 M sorbitol. 
.25 _ .White colonies growing, rapidlyjwerejso^ 

transformants were subsequently grown on YPM agar containing 0.1% phytic acid 


.v»f 


4 


t 
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(Sigma P8810) at 30°C overnight, then 0.1 M CaCI2 was poured onto the grown agar 
plate to detect phytase production. Positive transformants were detected as colonies 
surrounded by a clear halo. 

Phytase producing strains were inoculated in 50ml of YPD medium in 500ml of 
5 shaking flasks and cultivated at 30°C with vigorous aeration. After overnight cultivation, 
1ml of methanol was added to each shaking flask and the cultivation was continued one 
more day. Cells were removed by centrifugation and phytase activity in the supernatant 
was measured using 0.15% sodium phytate in 100mM sodium acetate buffer pH 5.5 as 
the substrate. The reaction was carried out at room temperature for 15 minutes, and 
10 liberated phosphate ions were quantified by colorization with 1% ammonum molybdate, 
0.26M FeS04 in 3.2% H2S04 and absorbance was measured at 650nm. The amount of 
enzyme protein was calculated from the activity based on the known specific activity per 
mg enzyme protein. The results are shown in Table 2. 

15 Table 2. 


Strain 

Piasmid 

Used qene 

Yield in shakinq flask 

PPi51 

pCaPhy3 

cDNA 

1 mg/l 

PPs237 

DCPS40 

codon optimized 

20 mq/l 


20 The transformants mentioned above were cultivated as described in example 1. 

The yield of A fumigatus phytase from a transforrnant generated with pCaPhy3, carrying 
non codon optimized cDNA of the phytase gene, was calculated to be 310 mg/l with 120 
g/kg of dried cells at 170 hours of fermentation, whereas the yield from the transforrnant 
with a codon-optimised gene was estimated to be 3.5 g/l with 115 g/kg of dried cells at 

25 1 90 hours of fermentation. 

Thus the yield from the codon optimized transforrnant was approximately 10 
times higher than what could be obtained from the transforrnant carrying the original 


A 


t 
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cDNA phytase gene. 
Example 3 

Expression of the A. fumigatus phytase from the GAP-promoter 
5 A synthesized A.fumigatus phytase gene was obtained from pCPS40 as described in 
Example 2 by digesting with the restriction enzymes EcoRI and Spel. The isolated 
1 .3kb fragment was ligated into the pGAP vector, which was initially pre-digested with 
the same enzymes, the resulting plasmid was termed pGAPphy. The plasmid 
pGAPphy was then digested with Notl and the 6.9kb fragment was used for the 
10 transformation of Rmethanolica PMAD16. Transformants were isolated as described 
in Example 2. 

Obtained transformants were cultivated in 5ml of YPD medium at 30°C overnight with 
vigorous aeration, then 1ml of grown culture was transferred to fresh 100ml of YPD 
medium in a shaking flask. After one-day cultivation at 30°C, 2ml of 20% glucose 
15 solution was added to each flask and cultivation was continued for one more day. 
Enzyme activity of the culture broth, from which the grown cells were removed, was 
measured to estimate the enzyme productivity. The estimated phytase productivity 
was47mg/l. 

Atransformant carrying a synthesized Aspergillus fumigatus phytase gene was 
20 fermented in 5L jar-fermenter. The fermentation vessel was filled with 2 liters of medium 
containing 43g (NH4)2S04, 5.2g K2HP04, 19g KH2P04, 0.8g Fe SO4.7H20, 2.3g 
sodium citrate with 2aq. After autoclaving and cooling down to 30°C, 94ml of 60% 
glucose, 40ml of 1M MgS04.7H20, 4ml of 1M CaCI2, 40ml of trace metal solution and 
4ml of vitamin solution were added. The pH of fermentation media was adjusted to 5.0 
25 and during the fermentation it was controlled automatically above 5.0 with 25% ammonia 
water. Aeration was provided as compressed air.at a flow rate of 3 litres per minute 
initially and agitation at the rate of 100 rpm. After inoculation the agitation was controlled 
automatically from 100 rpm to 1000 rpm to keep the dissolved oxygen level at minimum 
30% of saturation. Around 11 hours after inoculation, 1100ml of 60% glucose was 
30 supplemented. The yield of Aspergillus fumigatus phytase at 50 hours was calculated to 
1000 mg/l with the 65 g/kg of dried cells. 
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CLAIMS 

1. A Pichia cell comprising at least one copy of a heterologous nucleotide sequence 
encoding a polypeptide of interest, wherein the codon usage of said sequence has been 
adjusted to match the preferred codon usage of R methanolica, as defined herein. 


25 


2. The cell of claim 1 , which is a Pichia methanolica cell 


3. The cell of claim 1 or 2, wherein the nucleotide sequence encodes an enzyme. 


10 4. The cell of claim 1-3, wherein the nucleotide sequence encodes an enzyme having 
phytase or xylanase activity. 


5. The cell of claim 1, wherein the nucleotide sequence comprises a DNA sequence at 
least 90% identical to the sequences shown in SEQ ID No. 3, SEQ ID No. 19 t or to the 
15 phytase encoding sequence comprised in a cell of DSM 13352. 


6. The cell of claims 1-5, wherein the nucleotide sequence is present on an 
extrachromosomal DNA construct, preferably a plasmid. 


20 7. The cell of claims 1 - 5, wherein the nucleotide sequence is integrated in the cell's 
chromosome. 


8. The cell of claims 1-7, wherein the nucleotide sequence is transcribed from a 
promoter of a methanol-inducible R methanolica gene. 

9. The cell of claim 8, wherein the promoter is a native promoter of a R methanolica 
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gene encoding an enzyme selected from the group consisting of alcohol oxidase, 
dihydroxyacetone synthase, formate dehydrogenase, and catalase. 

10. The cell of claims 1 - 9, wherein the polypeptide comprises its native original 
5 secretion signal sequence. 

11. An isolated DNA construct comprising at least one copy of a nucleotide sequence 
encoding a polypeptide heterologous to Pichia methanolica, wherein the codon usage of 
said sequence has been adjusted to match the preferred codon usage of P. methanolica, 

10 as defined herein. 

12. The DNA construct of claim 11, wherein the nucleotide sequence encodes an 
enzyme. 

15 13. The DNA construct of claim 11, wherein the nucleotide sequence encodes an 
enzyme having phytase or xylanase activity. 

14. The DNA construct of claim 11, wherein the nucleotide sequence comprises a DNA 
sequence at least 90% identical to the sequences shown in SEQ ID No. 3, SEQ ID No. 

20 19, or to the phytase encoding sequence comprised in a cell of DSM 13352. 

1 5. The DNA construct of claims 11-14, wherein the nucleotide sequence is transcribed 
from a promoter of a methanol-inducible R methanolica gene. 

25 1 6. The DNA construct of claim 1 5, wherein the ^promotejr^is a native promoter of _a_R 

methanolica gene encoding an enzyme selected from the group consisting of alcohol 
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oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase. 

17. The DNA construct of claims 11 — 16, wherein the nucleotide sequence encodes a 
polypeptide comprising its native original secretion signal sequence. 

5 

18. A method of producing a polypeptide of interest in a Pichia cell, where the 
polypeptide is encoded by a nucleotide sequence heterologous to P methanolica, said 
method comprising the steps of: 

a) adjusting the codon usage of the sequence to match the preferred codon usage of R 
10 methanotica, as defined herein; and 

b) cultivating a Pichia cell comprising at least one copy of the codon usage adjusted 
sequence of step a) under appropriate growth conditions to express the sequence 
and achieve production of the polypeptide of interest. 

15 19. The method of claim 18, wherein the Pichia cell is a Pichia methanolica cell. 

20. The method of claim 18 or 19, wherein the nucleotide sequence encodes an 
enzyme. 

20 21 . The method of claim 18 or 19, wherein the nucleotide sequence encodes an enzyme 
having phytase or xylanase activity. 

22. The method of claim 18 or 19, wherein the nucleotide sequence comprises a DNA 
sequence at least 90% identical to the sequences shown in SEQ ID No. 3, SEQ ID No. 
25 1 9, prjo the phytase.encpdjng^sequenpe^ in a cell. of DSM J.3352. 
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23. The method of claims 18-22, wherein the polypeptide is produced in a yield at 
least three times higher than what is obtained from an otherwise identical cell 
comprising a sequence not adjusted to the preferred codon usage of P. methanolica, 
as defined herein. (Preferably at least five times higher, preferably seven, more 

5 preferably eight, even more preferably nine, and most preferably ten times higher.) 

24. The method of claims 18-23, wherein the nucleotide sequence is transcribed from a 
promoter of a methanol-inducible P. methanolica gene. 

r 

10 25. The method of claim 24, wherein the promoter is a native promoter of a P 
methanolica gene encoding an enzyme selected from the group consisting of alcohol 
oxidase, dihydroxyacetone synthase, formate dehydrogenase, and catalase. 

26. The method of claim 24 or 25, wherein methanol is added to the cell during the 
15 cultivating step, whereby the methanol-inducible promoter is induced. 

27. The method of claims 18-26, wherein the nucleotide sequence encodes a 
polypeptide comprising its native original secretion signal sequence. 
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SEQUENCE LISTING 

< 1 1 0 > Novczyrr.es A/S 

< 1 2 0 > High yield production of polypeptides in Pichia 
methanol ica 

<130> 6125.200-U5 

<140> 
<141> 

<160> 91 

<170> Pacentln.Ver. 2.1 

<210> 1 
<2.11> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer xyN 
<400> 1 

ccggaattca tggtcggctt tacccccgtt 30 


<210> 2 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer XyC 
<400> 2 

aaggaaaaaa gcggccgcac tagtttagcc cacgtcagca ac 42 


<210> 3 
<211> 678 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Codon 
optimized xylanase encoding sequence . 

<400> 3 

atggttggtt ttactccagt tgctttagct gctttagctg ctactggtgc tttagctttc 60 
ccagcaggta atgctactga attagaaaaa agacaaacaa ctccaaattc tgaaggttgg 120 
catgatggtt attattattc ttggtggtct gatggtggtg ctcaagctac ttatactaat 180 
ttagaaggtg gtacctatga aatttcttgg ggtgatggtg gtaatttagt tggtggtaaa 24 0 
ggttggaatc caggtttaaa tgcaagagct attcattttg aaggtgttta tcaaccaaat 300 
ggtaattctt atttagctgt ttatggttgg actagaaatc cattagttga atattatatt 360 
gttgaaaatt— ttggtactta-tgatccatct- tctggtgcta ctgatttagg tactgttgaa -420- 
tgtgatggtt ctatttatag attaggtaaa actactagag ttaatgcacc atctattgat 480 
ggtactcaaa ctttcgatca atattggtct gttagacaag ataaaagaac ttctggtact 540 
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gttcaaactg grccccatt:: cgacgcccgg gctagagctg gtttgaatg:: ~aa tggtgat 500 
cattatcatc aaattgctgc- aactgaaggt tac:tct:c:t. ctggc t atgc cagaa:tact 650 
gttgctgatg ttggttaa 67 B 


<210> 4 
<211> 120 
<2I2> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Description of Artificial Seauence : Primer XvCl 
<400> 4 

atggttggt t ttactccagt tgctttagct gctttagctg ctactggtgc tttagctttc 60 
ccagcaggta atgctactga attagaaaaa agacaaacaa ctccaaattc tgaaggttgg 120 


<210> 5 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer XyC2r 
<400> 5 

cataggtacc accttctaaa ttagtataag tagcttgagc accaccatca gaccaccaag 60 
aataataata accatcatgc caaccttcag aatttggaga 100 


<210;> 6 
<211> 120 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer XyC3 
<400> 6 

ggtacctatg aaatttcttg gggtgatggt ggtaatttag ttggtggtaa aggttggaat 60 
ccaggtttaa atgcaagagc tattcatttt gaaggtgttt atcaaccaaa tggtaattct 12 0 


<210> 7 
<211> 120 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer XyC4r 
<400> 7 

ctagtagttt tacctaatct ataaatagaa ccatcacatt caacagtacc taaatcagta 60 
gcaccagaag atggatcata agtaccaaaa ttttcaacaa tataatattc aactaatgga 120 


<210> 8 
<211> 120 
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<212> DNA 

<213> Artificial Sequence 
<220> 

< 2 2 3 > Description of Artificial Sequence: Primer xyC5 
<400> 8 

ttctggtgct actgatttag gtactgttga atgtgatggt tctatttata gattaggtaa 60 
aactactaga gttaatgcac catctattga tggcactcaa act ttcgatc aatattggtc 120 


<210> 9 
<211> 120 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer xyC6r 
<400> 9 

taataatgat caccattaac attcaaacca gctctagccc aagcatcgaa atgacaacca 60 
gtttgaacag taccagaagt tcttttatct tgtctaacag accaatattg atcgaaagtt 120 


<210> 10 
<211> 89 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer xyC7 
<400> 10 

ttaatggtga tcattattat caaattgttg caactgaagg ttatttctct tctggttatg 60 
ctagaattac tgttgctgat gttggttaa 8 9 


11 
36 
DNA 

Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer xycl2N 
<400> 11 

.ataagaatgc ggccgcatgg ttggctttac tccagt 3 6 


<210> 
<211> 
<212> 
<213> 


<210> 12 
«=211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer xyc!2C 


<400> 12 

tcataggtac caccttctaa attagtataa 30 


r 


f 


WO 01/66693 


PCT/DK0 1/001 54 


4 



: : A: . . f icial Sequence: Primer xyc37N 


<40C> 


tnacaac : 


\ >: -i aatttcttgg ggtgat 


36 


<210> I-: 
< 2 1 1 > j * 

<2i2> dn;- 

<213> Ar\ . ■ t ■ 

<22C> 

<2 23> De/r; .:• : '..facial Sequence: Primer xyc3 7C 

<400> 14 

ttttcct::-. : .1.. rcaacatc agcaac 36 


<210> 15 
<211> 33 
<212> DMA ' 

< 2 1 3 > Ar t i t : ir •..* :r.:«.* r.c c 

<220> 

<223> Descr i pt : :: Artificial Sequence: Primer a) 


<210> 16 
<211> 32 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer b) 
<400> 16 

ggactagttt aaccaacatc agcaacagta at 32 

<210> 17 
<211> 89 
<212> DMA 

<2 13 > Artificial Sequence 


<400> 15 

ccggaattca tgTttgnttt tactccagtt get 


33 


<220> 
<223> 


Description of Artificial- Sequence Primer- Al 


<40G> 


17 


BNSDOCID: <WO 0166693A1 I > 
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! C CJC 3 3. ~ 3 _ 3l t C a 1 1 a 

acc caq 


— — i— \- — <— w — — w ^ '-^ s_ --^ — < — I >_ ■S--J^ , _C»-<ii C v_ 


tgr ctgcgg 


gttc tgctggctc 


89 


<210> IE 

< 2 1 1 > 3 5 

< 2 1 2 > DMA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer El. 
<400> IS 

ctagactagt ctagtcaact aaagcactct cccca 35 


<210> 19 
<211> 1393 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pichia 

methanol ica codon optimized phytase encoding 
secuence . 


<400> 19 

atgattactt 

gccccttctt 

cctgctactt 

tccgtttcat 

catggtgcta 

atccaagcca 

actttaggtg 

aagttctacc 

ggttcagaca 

ttagctgatc 

tccgaaactt 

ttaggtgatg 

gccgaaaagc 

atgtgttcat 

caattattca 

tacggttacg 

tcaattgcca 

gtctccaacc 

aactccatgg 

agaacctcag 

ttcggtgcta 

agagctttga 

agatgtaagt 

ggtgaatgtt 


taact ttctt 
ctgctggttc 
ctcatctatg 
ctaagttacc 
gatacccaac 
acgccaccga 
ctgatgactt 
aaaggtacaa 
gagttattgc 
ctggtgctac 
tcaacaacac 
aagt tgctgc 
atttacctgg 
ttgatactgt 
ctcacaacga 
gtgccggtaa 
gattgactcg 
cagccacctt 
tttccatctt 
ttgaatccgc 
gagcctactt 
ttaacgacag 
taaacgactt 
tttcttga 


attatcagct 
caagtcctgt 
gggtt tatac 
aaaggattgt 
ctcctccaag 
cttcaagggt 
aactccatt t 
ggctttagct 
ttcaggtgaa 
taacagagcc 
tttagaccac 
caacttcact 
tgttacttta 
tgctagaacc 
atggaagaag 
ccctttaggt 
ttcaccagtt 
cccattgaac 
ctttgccttg 
caaggaattg 
cgaaactatg 
agttgttcca 
tgtcaagggt 


get tatttat 
gatactgttg 
tcaccattct 
agaatcacct 
tccaaaaagt 
aagtttgcct 
ggtgaacaac 
agatctgttg 
aagttcatcg 
gctccagcta 
ggtgtttgta 
gctttatttg 
acagacgaag 
tccgacgcct 
tacaactact 
ccagctcaag 
caagaccaca 
gctaccatgt 
ggtttataca 
gatggttatt 
caatgtaagt 
ttacatggtt 
tegtcttggg 


tatctggtag 
acttaggtta 
tttcattaga 
tggttcaagt 
ataagaagtt. 
ttttgaagac 
aattagttaa 
ttccatttat 
aaggtttcca 
tttctgttat 
ctaagtttga 
ccccagacat 
aegtegttte 
ctcaattatc 
tacaatcctt 
gtattggttt 
cctccactaa 
aegtcgaett 
aeggtactga 
ctgcctcctg 
cagaaaagga 
gtgatgttga 
ccagatctgg 


agtttctgct 60 
ccaatgttcc 120 
agacgaatta 180 
tctatcaaga 240 
agttactgct 300 
ttacaactat 360 
ctcaggtatc 420 
tagagectea 480 
acaagctaag 540 
tattccagaa 600 
agcttctcaa 660 
cagagctaga 720 
tctaatggac 780 
accattctgt 840 
gggtaagtac 900 
caccaacgaa 960 
ctcaactcta 1020 
ttcacacgac 1080, 
accattgtcc 1140 
ggttgttcct 1200 
acctttagtt 1260 
caagttaggt 1320 
tggtaactgg 13 30 

1398 


<210> 20 

<211> 40 

<212> DNA 

"<213> Artificial Seauence 


<220> 
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* 


I 
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<2 23> Description cf Artificial Sequence: ?r inner phyl . 
<400> 20 

atig3C.u3.CIl~ taaCtttCI" 3CCSC CSgCt CCCCciCCtaC 4 o 


<210> 21 
<211> 4C 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy2r 
<400> 21 

agcagaaact ctaccagaca ataaataagc agctgataat 40 


<210> 22 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy3 
<400> 22 

tatctggtag agtttctgct gccccttctt ctgctggttc 40 


<210> 23 
<211> 40 ■ 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Primer phy4r 
<400> 23 

caacagtatc acaggacttg gaaccagcag aagaaggggc 4 0 


<210> 24 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy5 
<400> 24 

caagtcctgt gatactgttg acttaggtta ccaatgttcc 40 


<210> 25 
<211> 40 

<212> DNA 

<213> Artificial Sequence 


BNSDOCID: <WO 0166693A1 I > 
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z : Artificial Secuence : Primer chv6r 


< 4 C 0 .* - 


i" ; 'jvaaca ttgg caacccaagt. 


40 


<210> 

<21X> 4_ 

<212> z::-\ 

< 2 1 3 > Ar: . : • ■ q uence 

<220> 

<223> L-:.*. ~ r : r : ; - r. \ : Artificial Sequence: Primer phy7 

<: 4 0 0 > ZC 

cctgcta:: * r* i* : v _ 7 "?- ttatac tcaccattct 4 0 

<210> 27 - 

<211> 4C 

<212> DNA 

<213> Ar: : : : : »1 

<220> 

<223> Desc::p:::r. Artificial Sequence.: Primer phySr 


<210> 28 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy9 
<400> 28 

tttcattaga agacgaatta tccgtttcat ctaagttacc 40 

<210> 29 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phylOr 


<400> 27 

taattcgtct r. ctaa t a.ia i ariaatggtga gtataaaccc 


40 


<400> 29 

aggtgattct acaatccttt ggtaacttag atgaaacgga 


40 


<210> 30 

<211>- 40 — - — 

<212> DNA 

<213> Artificial Sequence 


( 


4 
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< 2 2 0 > 

<222> Description of Artificial Sequence: Primer phyll 

< 4 0 C > 3 0 

aaaggatcgt agaatcacct tggttcaagt :c:accaag£ 40 


< 2 1 0 > 3 1 

c21i> 4C . ' 

<212> DNA 

<2I3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phyl2r 
<40C> 31 

gttgggtatc tagcaccacg tcttgataga acttgaacca 40 


<21C> 32 
<2il> 4 0 
<212> DNA 

<2I3> Artificial Sequence 


<220> 

<223> Description of Artificial Sequence: Primer phyl3 
<400> 32 

catggtgcta gatacccaac ctcctccaag tccaaaaagt 40 


<210> 33 
<211> 40 
<2I2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phyl4r 
<400> 33 

agcagtaact aacttcttat actttttgga cttggaggag . - 40 


<210> 34 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phyl5 
<400> 34 

ataagaagtt agttactgct atccaagcca acgccaccga 40 


-<210>-35~ 
<2il> 40 
<212> DNA 


BNSDOCID: <WO 0166693A1 I > 


WO 01/66693 PCT/DK01/00154 


<213> Artificial Secuence 


<220> 

<223> Description of Artificial Sequence-. Primer phy!6r 
<400> 35 

aggcaaacc: accctcgaag tcggtggcgt tggcttggac 40 


<2iC> 36 
<2I1> 40 
<2I2> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phyl7 
<400> 36 

cttcaagggt aagtttgcct ttttgaagac ttacaactat . 40 


<210> 37 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phylSr 
<400> 37 

aagtcatcag cacctaaagt atagttgtaa gtcttcaaaa 40 


<210> 38 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phyl9 
<400> 38 

actttaggtg ctgatgactt aactccattt ggtgaacaac 40 


<210> 39 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy20r 
<400> 39 

gatacctgag ttaactaatt gttgttcacc aaatggagtt 40 


<210> 40 
<211> 40 


BNSDOCID: <WO 0166693A1 I > 


WO 01/66693 PCT/DKO 1/001 54 

10 

<212> DNA 

< 2 13 > Artificial Sequence 
<220> 

<223 > Description of Artificial Sequence: Primer phy2 1 
<400> 40 

aattagttaa ctcaggtatc aac:tctacc aaaggtacaa 40 


<210> 41 
<2il> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy23 
<400> 41 

caacagatct agctaaagcc ttgtaccttt ggtagaactt 40 


<210> 42 
<21i> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy23 
<400> 42 

ggctttagct agatctgttg ttccatttat tagagcctca 40 


<210> 43 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy24r 
<400> 43 

gcaataactc tgtctgaacc tgaggctcta ataaatggaa 40 


<210> 44 
<2ll> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy25 
<400> 44 

ggttcagaca gagttattgc ttcaggtgaa aagttcatcg 40 


<210> 45 


BNSDOCID: <WO 0166693A1_I_> 


WO 01/66693 PCT/DKO 1/001 

1 1 

<21 1 > 4C 
<212> DNA 

< 2 1 3 > Artificial Seauence 


<220> 

<222> Description of Artificial Sequence: Primer phy26r 
<400> 45 

cttagcctgt tggaaacctt cgatgaactt ttcacctgaa 40 


<210> 46 
<211> 40 
<212> DNA 

<213> Artificial Sequence 


<220> 

<223> Description of Artificial Sequence: Primer phy27 
<400> 46 

aaggtttcca acaagctaag ttagctgatc ctggtgctac 40 


<210> 47 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy28r 
<400> 47 

tagctggagc ggctctgtta gtagcaccag gatcagctaa 40 

<210> 48 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy29 
<400> 48 

taacagagcc gctccagcta tttctgttat tattccagaa 40 

<210> 49 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy30r 
<400> 49 

9tgttgttga~a"agttX'c'gga^ttctggaata" ataacagaaa" '"" ----- - 4 - 


BNStX)CID: <WO 0166693A1 I > 


WO 01/66693 
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<2I0> 5C 
<2II> 4G 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy3 I 
<400> 50 

tccgaaacnt tcaacaacac tttagaccac ggtgtttgta 40 


<210> 51 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy32r 
<400> 51 

ttgagaagct tcaaacttag tacaaacacc gtggtctaaa 40 


<210> 52 
<211> 40 - 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy33 
<400> 52 

ctaagtttga agcttctcaa ttaggtgatg aagttgctgc 4 0 


<210> 53 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy34r 
<400> 53 

caaataaagc agtgaagttg gcagcaactt catcacctaa 40 


<210> 54 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy35 

■<400>-54 ----- — - 


caacttcact gctttatttg ccccagacat cagagctaga 40 


BNSDOCID: <WO 0166693A1 I 


WO 01/66693 PCT/DK01/00J54 


<210> 55 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy36r 
<400> 55 

ccaggtaaat gcttttcggc tctagctctg atgtctgggg 40 


<210> 56 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy37 
<400> 56 

gccgaaaagc atttacctgg tgttacttta acagacgaag 40 


<210> 57 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy38r 
<400> 57 

gtccattaga gaaacgacgt cttcgtctgt taaagtaaca 40 

<210> 58 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy39 
<400=> 58 

acgtcgtttc tctaatggac atgtgttcat ttgatactgt 4 0 

<210> 59 
<211> 40 • 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy40r 

<400> 59 

a 99 c 9tcgga ggttctagca acagtatcaa atgaacacat 40 


BNSDOCID: <WO 0166fiQ3A1 I > 


WO 01/66693 
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0> 


50 


40 


< 2 1 2 > DNA 

<2 1 3 > Artificial Sequence 
<220> 

<223 > Description cf Artificial Sequence: Primer phy41 
<400> 60 

tgctagaacc tccgacgcct ctcaattatc accattctgt 


<210> 61 
<211^ 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy42r 
<400> 61 

tcgttgtgag tgaataattg acagaatggt gataattgag 


<210> 62 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy43 

<400> 62 

caattattca ctcacaacga atggaagaag tacaactact 


<210> 63 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy44r 
<400> 63 

gtacttaccc aaggattgta agtagttgta cttcttccat 


<210> 64 
<211> 40 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description- of— Ar-t-i-ficial- Sequence Primer phy45' 

<400> 64 


BNSDOCID: <WQ 
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:a:aaccct; gggtaagcac -acggttacg gcgccggtaa 


<220> ' ■ — - 

<223> Description of Artificial Sequence: Primer phySOr 


<2 1 C > c: 
<211> 40 
<2I2> DNA 

<21 3 > Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy46r 
<4 00 > 55 

cttgagctgg acctaaaggg ttaccggcac cgtaaccgta 40 


<210> 66 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy47 
<400> 66 

ccctttaggt ccagctcaag gtattggttt caccaacgaa 4 0 


<210> 67 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> -Description of Artificial Sequence: Primer phy48r 
<400> 67 

cgagtcaatc tggcaattaa ttcgttggtg aaaccaatac 4 0 


<210> 68 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy49 
<400> 63 

ttaattgcca gattgactcg ttcaccagtt caagaccaca 40 


<210> 69 
<211> 40 
<212> DNA 

<213> Artificial Sequence 


BNSDOCID: <WO 01fifififi*Ai I 
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tgtggtcttg aactcgtgaa 


< ^' _ » . • 

< 2 1 i > *i 

< ^ - - > 

< 2 1 2 > /. ; 


<22C> 
<22 2 > 


: :' Artificial Sequence: Primer phySl 


<400> 
ccLccar*. 


ntccccaacc cagccacctt 


40 


<210> 7; 

<211> 4 1- 

<212> Di."/- 

<213> An 


<220> 

<22 3> Des:: : : 


iificial Sequence: Primer phy52r 


<400> 71 
acatggtag 


-■.-3"".' .la.^gtggctg ggttggagac 


40 


<210> 72 
<211> 40 
<212> DNA 

<213> Arnifiri.il sequence 
<220> 

c223> Descript :cn of Artificial Sequence: Primer phy53 
c400> 72 

cccattgaac gctar~atgt acgtcgactt ttcacacgac 


40 


<210> 73 
<211> 40 
<212> DNA 

<213> 'Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy54r 


<400> 73 

aagatggaaa ccatggagtt gtcgtgtgaa aagtcgacgt 


40 


<210> 74 
<211> 40 
<212> DNA 

c213> Artificial Seauence 


<220> 

<223> Description of Artificial Sequence: Primer phy55 


^\ BNSDOCID: <WO 0166693A 1_l_> 
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<400> "4 

aactccatgg t t tcca.zczz ctttgcctcc cgtt cataca 40 

<210> 75 
<211> 40 
<2I2> DMA 

<2I3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy56r 
<400> 75 

ggacaatggt tcagtaccgt tgtataaacc caaggcaaag 40 

<210> 76 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy57 
<400> 76 

acggtactga accattgtcc agaacctcag. ttgaatccgc 40 

<210> 77 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy58r 
<400> 77 

aataaccatc caattccttg gcggattcaa ctgaggttct 40 

<210> 78 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy59 
<400> 78 

caaggaattg gatggttatt ctgcctcctg ggttgttcct 40 

<210> 79 
<211> 40 
<212> DNA 

<2 13 >- -Artif icial- Sequence 

<220> 


BNSDOCID: <WO 0166693A1 I > 


WO 01/66693 


PCT/DK01/00154 


IS 


cr:o: ion cf Artificial Sea 


uence 


Prime 


r chv6 Or 


<400> 79 
aagtaggctc 


tag 



ca 39 


aggcag 


40 


<210> SO 
<211> 40 
<212> DNA 

<2 13 > Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy61 
<400> 80 

ttcggtgcta gagcctactt cgaaactatg caatgtaagt 4 0 

r 

<210> 81 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy62r 


<210> 82 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy63 
<400> 82 

cagaaaagga acctttagtt agagctttga ttaacgacag 4 0 

<210> 83 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer phy64r 


<400> 81 

aactaaaggt tccttttctg acttacattg catagtttcg 


40 


<400> 83 

aaccatgtaa tggaacaact ctgtcgttaa tcaaagctct 


40 


<210> 84 
<211> 40 

<212> DNA - - 

<213> Artificial Sequence 


BNSDCCID: <WO 0166693A1J > 
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c2M > 

< 2 2 :• > 


< - r r . 


cificial Sequence: Primer phy6 5 


.;:ga:c:cga caagttaggc 


40 


<210> zi 
<211> 


21 


<213> A : *. 


' *3 Tt A 


<220> 

<223> D*. . : r . : ■ . :. : Artificial Sequence: Primer phy66r 


<400> 8E 
aagtcgt zz .» ... - *. 


* arrtaacttg tcaacatcac 


40 


<210> 8€ 
<211> 40 
<212> DMA 
<213> Art: 


<220> 

<223> Desrr.ri: r. Artificial Sequence: Primer phy67 
<400> 86 

agatgtaagt c i/Kic^a::: tgtcaagggt ttgtcttggg 


40 


<210> 87 
<211> 40 
<212> DNA 

<213> Artificial Soauence 


<220> 

<223> Description of Artificial Sequence: Primer phy68r 
<400> 87 

ccagttacca ccagatctgg cccaagacaa acccttgaca 


40 


<210> 88 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence: Primer phy69 
<400> 88 

ccagatctgg tggtaactgg ggtgaatgtt tttcttga 


38 


<210> 89 

-<-2-ll>- 18 

<212> DNA 

<213> Artificial Sequence 
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PCT/DK01/00I54 


<220> 

<223 > Description cf Artificial Sequence: Primer phy70r 
<40C> 85 

icaagaaaaa cattcacc 18 

< 2 1 0 > 9C 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Descriotion of Artificial Seauence: Primer c) 
<400> 90 

cggaattcat gattacttta actttcttat tatcagctta ttta 44 


<210> 91 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer d) 
<400> 91 

ggactagttc aagaaaaaca ttcacccc 2 8 
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